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intelligence and library searching. Furthermore, a new algorithm has been devised 
to combine the results from the library searching and the PAIRS program to obtain 
high confidence level interpretational data. Major functionalities of the PC 
PAIRS* are illustrated in Figure 1. 



SUMMARY 

A turn-key hazardous wastes analysis system has been developed using Fourier 
transform attenuated total reflectance infrared spectrometry, artificial intelligence 
assisted library searching and an automated infrared interpreter. Both the library 
searching and the infrared interpreter take advantage of the portability, the comput- 
ing power and the mass data storage available for PC systems and is demonstrated 
to be a cost effective tool to assure consistent data interpretation. This turn-key 
system has a 10 nanogram detection limit and provides fast sample turn around 
time. The possibility of direct sample analysis without any clean-up process is 
demonstrated. Preliminary results from this developmental work indicate the 
possible improvement of sample throughput by as much as 300%. 



INTRODUCTION 

This automated infrared interpreter was developed on an IBM personal com- 
puter (PC) running under the Microsoft disk operating system (DOS). Based on the 
original Merck Sharp & Dhome Research Laboratory Program for the Analysis of 
Infrared Spectra (PAIRS) (1,2). this infrared interpreter, PC PAIRS*, is capable of 
analyzing infrared spectra measured with a wide variety of spectrophotometers. 
Modifications have been made to PAIRS to enable the application of both artificial 
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Figure 1. Functionalities of PC PAIRS* 



All software was developed on a Magus Data Technology (Richmond Hill, 
Onatrio) 80386/80387 IBM DOS PC running at a clock speed of 20 MHz. The 
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original PAIRS and the infrared interpretation rule generator, the Computer Ori- 
ented Notation Concerning Infrared Spectral Evaluation (CONCISE) (3,4) lan- 
guage, were obtained from the Quantum Chemistry Program Exchange 
(Bloomington, Indiana, Catalog # QCPE 426). The programs were downloaded 
into the PC via a RS-232 line from a VAX 780 system. Both programs were modi- 
fied and converted to standard Ryan McFarland FORTRAN (Phoenic Computer 
Products, Norwood, MA) conventions, compiled and run on the same PC. 

The library searching subprogram used five search algorithms. They are: 
absolute difference (AB), square of absolute difference (SQ), absolute difference 
of first derivative (AD), square of difference of first derivative (SD) and time 
domain cross correlation (FT). One additional algorithm was developed as part of 
this project to combine the library searching results from the first five algorithms, 
the combined algorithm (CO). 

EXPERIMENTAL 

A. Chemicals and Melhod of Analysis. A total of 67 oil and grease samples were 
measured in this work. These samples are standards of the most commonly 
encountered petroleum hydrocarbons and greases in the province of Ontario. A 
Barnes (SpectraTech Inc., Stamford, CT) micro circle cell accessory was used to 
measure the attenuated total reflectance (ATR) spectra of the sar .pies. These spec- 
tra then formed the MOE Oil & Grease Library. 




Several ATR infrared spectra were generated to test the software and the Oil 
& Grease library. Typically, about 2 ml of oil samples were used to spike one litre 
of up water. Using the standard MOE method for the solvent extractable (5) the 
oils was extracted from the water sample using dichloromethane. Upon evapora- 
tion of the solvent the infrared spectrum of the residue was measured 

B. Instrumentation. Either a BOMEM MB-100 (Quebec. Quebec) or a Nicolet 
5SX (Madison, WI) Fourier transform infrared (FT-IR) spectrometer was used to 
measure the spectra. An optical retardation of 0.25 cm was used to signal average 
100 single sided interferograms with a resolution of 2 cm 1 . The BOMEM FT-IR 
used an IBM DOS PC computer for the data collection and manipulation. For the 
Nicolet 5SX FT-IR, a Nicolet 620 data system was used to serve the purposes. 
Spectra measured with the BOMEM FT-IR were used without any change. For 
spectra measured on the Nicolet FT-IR a NICOMPC communication software (Ni- 
colet Inc., Madison, WI) was used to download the spectra into the PC via a RS- 
232 line. The downloaded spectra were then used to create the spectral library. 



RESULTS AND DISCUSSIONS 

Figure 2 illustrates the process employed by PC PAIRS' in the evaluation of 

unknown spectra and repot generation. Library searching was used for the posi- 
identification of unknowns. In order to minimize the effect of noise and base- 
problems of the infrared spectrum, results from all five library searching 
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algorithms (AB. SQ, AD, SD, CO) were combined to determine the best match for 
the unknown. Table 1 illustrates a typical library searching report from this turn- 
key system. The test sample used was IOSOL 1520, a common paint thinner used 
for the oil based paint. The library searching routine was set lo list the five best 
matches from each individual routine. From Table 1, the AB and the SQ algo- 
rithms fail to identify the unknown from the Oil & Grease library. The CO method, 
on the other hand, successfully identified this unknown and many others, 
supporting its use as the method of choice for the library searching. 



Measurement of ATR Infrared spectrum 



Downloading spectral data 
Into MS/DOS PC 
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Figure 2. Procedures for a typical unknown sample 
analysis using the turn-key system. 
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Although the oil library with the five search algorithms have been successful 
for the identification of many routine sample analysis, we have encountered some 
situations where PAIRS* proved to be superior. To illustrate mis we added digital 
noise to a spectrum of Turbo A-l oil to simulate the spectrum of an unknown 
organic extracted with dichloromethane. This spectrum was then searched through 
our oil library using CO algorithm. The result is shown in Figure 3. From this 
Figure, it is clear that all three matches and the test spectrum have identical spectra 
which makes the identification of the test spectrum from the library searching 
results impossible. Using PAIRS', although interprelational results from the 
infrared interpreter indicated the presence of methyl and methylene group in the 
tinknown and in all three matches, the probability order for the " functional groups- 
identifies the match #1 as the correct identification. These results arc listed in 
Table 2. By comparing the probabilities obtained for match #1 with those obtained 
for matches #2 and #3 one can conclude that only match #1 is the correct answer. 
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Figure 3. Library searching results for the lest Sample using tlie CO method The 
close resemblance between the unknown and the three matching spectra compli- 
cates the lask of positive identification requiring. 






CONCLUSIONS 

A turn-key system has been developed to utilize artificial intelligence and the 
positive identification capability of the library searching. The CO algorithm which 
combines results from five different library searching routines can obtain high 
quality library searching data. The typical analysis time of this tum-key system, 
including the data entry, the library searching and the final PAIRS' interpretation, 
is about five minutes per sample. PC PAIRS* not only increases the sample 
throughput of routine sample identification in the laboratory but also standardizes 
the procedure of infrared data interpretation. Current system is with an enhanced 
user interface and takes advantage of the ready access and user friendly nature of 
PC computer systems, therefore, can be an invaluable training system in the labo- 
ratory. 
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Table 1 . Typical PC PAIRS+ Library Searching Report 

The simple spectrum is: lOSOL 1520 

The reference library l«: »>» The Oil Library ««< 
The library Se»rch Result 



* Match* I wilh 

File* 35: 

* Match #2 wilh 

File* 19 

* Mmch»3 wilh 

Hie* 21 

* Maich #4 with 

File # 32 

* Much #5 wilh 

File # i. 



• Malch #1 wilh 

File* 16. 

• Match *2 wilh 

File* 32 

• Match #3 with 

File* 21 

• Match *4 with 

File* 53 

• Malch »5 with 

File* 27 



( The FT Method ) 
probability of »»»» 0.02O43O ««««: 
IOSOL 1520, TRACE ORGANICS 

probability of »»»» S-«*?' ^IISJU 
F.SSO REGULAR GASOLINE. TRACE ORGANICS 
probability of »»»» 0.026579 ««««: 

: ISOPAR -M\ TRACE ORGANICS 
probability of »»»» 0.029905 «««« 

lSOPAR^K, TRACE ORGANICS 
probability of »»»» °J? 5 »" ££S <<<<: 

DISTILLED ISOPAR H, TRACE OROANICS 

The Library Search Result 

( The AB Method ) 
probability of »»»» 27.904972 ««««: 

IOSOL 1125. TRACE ORGANICS 

probability of »»»» 30.336798 ««««: 

ISOPAR'lC, TRACE ORGANICS^ 
probability of »»»» 30.400099 ««««: 

ISOPAR 'M', TRACE ORGANICS 
probability of »»»» 32.325081 ««««: 

TURBO FUEL, TRACE ORGANICS 

probability of »»»» 32.441692 ««««: 

IOSOL 2024, TRACE ORGANICS 



* Malch *l with 

File # 35 

* Mitch #2 wilh 

File* 32 

* Malch *3 with 

File* 21 

* Match #4 wilh 

File* 16 

* Match #5 wilh 

File* 24 



The Library Search Result 

( TheADMeihod ) 
probability of »»»» 2.182141 ««««: 

IOSOL 1 520, TRACE ORGANICS 
ratability of >>>>>>>> 3.216939 ««««: 

ISOPAr'k'. TRACE ORGANICS 

probability of »»»» 3.284418 ««««. 

ISOPAR 'M. TRACE ORGANICS 
probability of »»»» 4. 126999 ««««: 

IOSOL 1125. TRACE ORGANICS 

probability of »»»» 4.243579 ««««: 

VARSOL 3135, TRACE ORGANICS 






The Library Search Result 
( The SD Method ) 




7) AROMATIC °-°° 

8JAROM-MONOSUBST 0.00 

9) Zl 

IO)AROM-1.2^UBST 0.00 

Interpretaiion for library match # 2 

Spiral data summary for: Kcrosinc. TRACE ORGANICS 
The solvent is: neat 



15JPYRIMIDYL 

16) PURINE 

17) INDOLE 

18) PYRROLE 

19) FURAN 

20) THIOFHENE 



0.01 
0.01 
0.01 
0.00 
000 
0.00 






No empirical formula given 



There are 



Line 

1 

2 

I 

J 

6 
7 

K 

9 



Width 
sharp 
sharp 
sharp 
sharp 
sharp 
sharp 
sharp 
sharp 
sharp 

10 sharp 

11 sharp 

12 sharp 

1 3 sharp 

14 sharp 

15 sharp 

16 sharp 

17 sharp 

18 sharp 

19 sharp 

20 average 

21 average 

22 average 

23 broad 

24 average 



24 peaks 
Intensity 
1 



1 

1 

! 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

I 

1 

1 

1 

3 
4 
S 

ft 

7 
1(1 



location 
729 
739 
771 
918 
964 
980 
1007 
1043 
1055 
1063 
1066 
1074 
1090 
1097 
1124 
1151 
1169 
1340 
1365 
1379 
1464 
2873 
2927 
2956 



Printout is sorted by probability: 

Group name Probability 

1) METHYL 

2) ACID 

3) METHYL GEMD1 

4) KETAL 

5) AMINE 

6) AMINE TERTIARY 

7) ACETAL 

8) NH3+ 

9) MERCAPTAN 

10) METHYLENE 
U) AROMATIC 
12)HETEROAROMATlC 
13)PYRIDYL 
14) PYRAZYL 



0.88 

0.75 

0.55 

0.45 

0.40 

0.40 

0.40 

0.25 

0.25 

0.22 

0.05 

0.01 

0.01 

0.01 



Interpretation for library match # 3 

Spectral data summary for; ESSO Stove Oil. TRACE ORGANICS 

The solvent is: neat 

No empirical formula given 

There are 22 peaks 

Line Width Intensity 

1 average 

2 sharp 

3 sharp 

4 sharp 

5 sharp 

6 sharp 

7 sharp 
« sharp 

9 sharp 

10 sharp 

1 1 sharp 

12 sharp 

13 sharp 

14 sharp 

15 sharp 

16 sharp 

17 sharp 

18 average 

19 average 

20 average 

21 sharp 

22 average 10 



Location 
739 
773 

964 

1007 

1032 

1038 

1045 

1055 

1059 

1068 

1082 

1103 

1122 

1151 

1169 

1365 

1379 

1460 

2871 

2927 

2956 



Printout is sorted by probability: 

Group name Probability 

1) METHYL 0.93 

2)ACETAL 0.R4 

3) METHYL-GEMDI 0-55 

4) KETAL 0.52 

5) METHYLENE 0.49 

6) AMINE 0.40 
7)AMINE-TERT1ARY 0.40 

8) AROMATIC 0.05 

9) ACID 005 

1 0) I IETEROAROM ATIC 0.0 1 
11JPYR1DYL 0.01 
12) PYRAZYL 0.01 
!3)PYRIMIDYL 0.01 

14) PURINE 0.01 

15) INDOLE 001 



686 



Illllllll 

(7288) 
TD/5/T43 



